Stochastic Multiple Context-Free Grammar for RNA Pseudoknot Modeling
نویسندگان
چکیده
Several grammars have been proposed for modeling RNA pseudoknotted structure. In this paper, we focus on multiple context-free grammars (MCFGs), which are natural extension of context-free grammars and can represent pseudoknots, and extend a specific subclass of MCFGs to a probabilistic model called SMCFG. We present a polynomial time parsing algorithm for finding the most probable derivation tree and a probability parameter estimation algorithm. Furthermore, we show some experimental results of pseudoknot prediction using SMCFG algorithm.
منابع مشابه
RNA pseudoknot modeling using intersections of stochastic context free grammars with applications to database search.
A model based on intersections of stochastic context free grammars is presented to allow for the modeling of RNA pseudoknot structures. The model runs relatively fast, having the same order running time as stochastic context free grammar parsers. The model is shown to be able to perform database searches and find RNA sequences which resemble RNA pseudoknots which bind biotin. The problem domain...
متن کاملRNA Structure Prediction Including Pseudoknots Based on Stochastic Multiple Context-Free Grammar
Several grammars have been proposed for modeling RNA pseudoknotted structure. In this paper, we focus on multiple contextfree grammars (MCFGs), which are natural extension of context-free grammars and can represent pseudoknots, and extend a specific subclass of MCFGs to a probabilistic model called SMCFG. We present a polynomial time parsing algorithm for finding the most probable derivation tr...
متن کاملSoft Computing based Model for Identification of Pseudoknots in RNA Sequence using Learning Grammar
RNA structure prediction is one of the major topics in bioinformatics. Among the various RNA structures, pseudoknots are the most complex and unique structure. Various methods have been used for modeling RNA pseudoknotted secondary structure. In this paper a new model for prediction of RNA pseudoknot structure has been proposed. In this model, features of two existing techniques, i. e. neural n...
متن کاملOn the Generative Power of Grammars for RNA Secondary Structure
Several grammars have been proposed for representing RNA secondary structure including pseudoknots such as simple linear tree adjoining grammar (sl-tag), extended sl-tag (esl-tag) and RNA pseudoknot grammar (rpg). The main purpose of this paper is to compare the generative power of these grammars by identifying them as subclasses of multiple context-free grammars (mcfg). Specifically, it is sho...
متن کاملSmall Subunit Ribosomal RNA Modeling Using Stochastic Context-Free Grammars
We introduce a model based on stochastic context-free grammars (SCFGs) that can construct small subunit ribosomal RNA (SSU rRNA) multiple alignments. The method takes into account both primary sequence and secondary structure basepairing interactions. We show that this method produces multiple alignments of quality close to hand edited ones and outperforms several other methods. We also introdu...
متن کامل